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i This issue continues’ the practice of presenting analyses, of 
\ research repdrts in clusters based upon a common focus. The first 


, group, INDIVIDUAL ANALYSES, contains critiques of reports on an 


a 


5 7 : ~ 
individual™basis. The second group, INSTRUCTJON, -contains two 


> . 
stydies dealing, with the relationship of instruction to student 


outcomes. Three studies related to developniental factors are 2 @’ 


grouped together dnder COGNITIVE DEVELOPMENT. -Finallty, the section 
anita PAAGETIAN STUDIES ingludes a slightly different approach 
‘to the chustering st studies. Thisgsection cokrudne “five related 
= studies based upon Piagetian chedcy, conducted by basically the 
* . , 


same team of researchers, and all analyzed by the same reviewerg. 
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Expanded Abstract and Analysis 


gence, *Questioning , 
Secondary School Science 


Descriptors--*Creativity, Grade 9, a1 
Techniques, *Student Characteris 


fared Japoctaty for 1.S.E. by 
N. Eldred Bingham, University. of Florida. 


Purpose 

Thé purpose of this investigation was to determine which students 
were asking what kinds of questions. Specifically, how do variations 
in intelligence and creativity affect the number and level of questions 
being asked by junior high school science students? The five major 
objectives were to determine! . - 


\ 


1. How variations in intelligénce and creativity relate to the 
total questions and/or the number of qereone at each level 
on the taxonomical hierarchy. 


2. If ‘the levels and/or the total numbér of questions adeai can 
be used to predict credtivity. 


3. How the levels and/or the total ‘ales of questions asked are" 
related to intelligence. 


~ 


4. -If high, medium, and low intelligence groups and high, medium, 
and low creative groups can be differentiated by the level of 


questions asked : - 
5. If the groups associated with each sciencé teacher can be 
differentiated by the levels of questions they ask. 


The six hypotheses tested were: 


Hypothesis I - There are no significant differences in the 


' questions asked (concrete, abstract, and creative) 


due to differences in intelligence or creativity. 


Hypothesis II - The levels and total number of questions asked 
cannot be used to predict creativity. 


Hypothesis III - The levels and total number of questions asked 
4 are not related to intelligence. 


Hypothesis IV - High, medium and low intelligence groups cannot 


be differentiated by the levels of ee 
asked. 


gS. ; 


. 
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Hypothesis V  —- High, medium, and low creative groups cannot be: 
' differentiated by the levels of questions asked. 


Hypothesis VI - The groups associated with each science teacher 
cannot be differentiated by the levels of questions 
asked. - ‘ : 

Rationale i 


. 


' 
” 


One of the most important and pervasive goals of educators at all 
levels is to develop skills and processes to improve both divergent and 
convergent production. - * a 


. Many have indicated that teachers can guide the thodght processes 
of students by caréfully choosing the questions they use in the class- 
room. Educators agree that the questions asked of students should 
stimulate thinking, but they also confirm that not all students are 
stimulated by the same queStions. It follows that questioning strategies 
should be developed that will aid individual students. By improving 
"question-asking" behavior of both students and teachers, it might be 
possible to afd individual students in developing their natural abil- 
ities and thinking processes. ;. : 
d ? ¥ 

‘Before developing thinking ability and questioning behavior, it is 
necessary to determine what skills and processes ‘students are now capable 
of using. 


Research Design and Procedure 


A sample of 164 junior high school, science students from two schools 
were administered an intelligence test, a creativity test, and a question 
generating experiment. High, medium, and low groups were formed for both 
intelligence and creativity. Three trained judges ornamines students’ 
questions into three categories or levels: concrete, Abstract, and crea- 
tive. The hypotheses were tested by use of multivariate analysis of 
variance, multiple regression, discriminant analysis, and multivariate 
analysis of covariance. ‘ 


Findings 


a l. There is a difference in the total number of “questions asked 
by high, medium, and low intelligence groups. The higher 
intelligence students asked the most questions while the lower 
Iq students asked the least. . 


2. There is a difference in the,total number of questions asked 
_ by high, medium, and low creative groups. The higher creative 
dents asked the most questions and the less creative students 
askéd\the least questions. 


- J ‘ -~ 

a There isa difference in the number of questions asked at each 
level'by the high, medium, and low intelli groups. The 
high dntelligence groups 2skec more qasersons t all levels; 
particularly at the concrete level. 


4. There is a difference-in the number of questions asked at each 
level between the’ high, medium and: low creative groups. The 
high creative-group asked more questions at all levels, ners 

‘ ticularly at the abstract and creative level. : 

5. ,The abstract and creative levels of questions and total number 

. of questions can be used to predfct creativity when the effects 

of intelligence are controlled. The concrete questions do not 
account for a significant amount of the variance ip predicting 
creativity. Total questions also account, for a significant 
amount of the variance ‘in the prediction of -creativity. 


6. When the effects of creativity are partialed out, neither 
concrete, abstract or creative questions account for a signifi-, 
‘7 cant amount of the variance in predicting intelligence. 


7. Creative questions account for most of the difference between 
the three levels of intelligence. 


8. High, medium and low creative groups can be differentiated by 
the levels of questions asked. When the variables of concrete, 
abstract and creative questions are considere@ simultaneously, 
they do significantly discriminate between the creativity groups. 
The univariate F-tests show that these groups are being separated 
by abstract and creative questionsr 


9. Using intelligence and,creativity as covariates, the groups of 
students associated with each science teacher can be differ- 
entiated when the three categories of questions (concrete, 
abstract and crea ) are considered simultaneously. The uni- 

_variate F-tests showed that the only level at which there was 
a significant difference was that of creative questions. 


Interpretations 


It has been shown in this study that intelligence and creativity do 
affect the level of questions that students ask in science. The effects 
that IQ and creativity had tended to be somewhat different. Intelligence 
seemed to-be more related to the convergent type of questions as repre- 
sented in this study by the concrete fategory and,creativity was related 
more to the divergent type of thinking. The divergent levels were repre- 
sented by the’ abstract and creative categories used in this research. 

It also appeared that the total number of questions asked was more 
affected by creativity than by intélligence. Before generalizing too 
far beybnd this sample and the procedures followed, this study angen be 
replicated. 
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The. results of, this study also indicatgd that acking questions of 
groups may not-be the best teachiay method, If the kinds of questions 
‘students ask are representative of the. type of thinking they themselves 
“are doing, then not all students are operating at the higher levels. 
Perhaps many students’ \qyestioning abilities can be improved by follow- 
ing the suggestions offered by Strasser (1) and Torrance (2). They have 
indicated,that being respectful of unusual questions and keeping records 
of student responses will help to improve questioning behavior. The ~ 
effect of teachers on creative question-asking, when the influences of 
creativity and intelligence are controlled, certainly indicates that 
“quest ion-asking" tfaining should be -included in teacher training pro- 
grams. . ’ ate 


ABSTRACTOR'S ANALYSIS be 
‘Even though much of what goes on in many classrooms is presenting 
something, all trué teaching is interaction between teacher and students. 
The use of questions by students or by teachers, is’ perhaps the most %& 
effective way of initiating and extending interaction or probing the 


“concepts, interests, and difficulties that may be present. One needs to 


know as much as possible about appropriate kinds of questions to ask in. 
particular situations. It should prove helpful in teaching to know as 
much as possible about the effects of creativity ahd intelligence on 
questions. The study {fs relevant to what happens in the classroom. 
Regrettably, relatively few studies have been concerned with this par- ‘. “ 
ticular ‘qroblem.. u if 

oa ‘ - wr ' 

» The use of a multivariate approach to the treatment of the data 
made possible the identification of a number of sist dene qualities of 
questions that relate to creativity, and the assessment of these factors 
as they relate to different levels of intelligence, I'm unaware of 
other studies in this particular area tyat have used this multivariate 
approach. Even though this study was done wifh students in but two 
junior high schools, the nature of the study and the size of the sample 
make the results valid and’ generalizable to other Situations. ° 


The written’ report gs adequate but it includes many more tables 

than are necessary and it is difficult to read. To quickly grasp what 
the author has been doing, one needs to turn to the summary first, -and 
then read the report from the beginning. It would’ ease the track,of 

the rdéader if the author had followed the outline of topics presented 

in these abstracts in his presentation.: As he presented the information, 
one doesn't. discover the particular hypotheses that are being tested 

until he reads the “results of hypothesis testing.' ‘ 
This study, having identified the effects of creativity and ~ 
intelligence on many types of pupils questions can serve as a base 
for many specific researchers on these factors that truly promote 
creativity. . 
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Expanded Abstract and Analysis Prepared Especially for I.S.—. by . : 7 
Victor E. ‘Lopez-Tosado and Fletcher G. Watson, Harvard University. . 


Purpose 


This study was done to: (1) investigate the change in "understanding 
‘the nature of science" among talented secondary students studying a 
physics-astronomy course during a. summer institute; (2) compare the level 
of understanding of this ‘high-ability student group with other student 
and teacher groups; and (3) investigate the correlation between "under- 


standing the nature of science" and "critical thinking" and "stience 
_ achievement." 


a 


Rationale 7 Pi 


Great emphasis has been given to the importance of learning in this 
area--thé understanding of the nature of science--hence the importance 
of measuring it. Knowledge of how students do in this area might help 
in the developing of better teaching materials and teaching methods. 


Research Nesign and Procedure 


The sample consisted of 29 juniors and seniors attending the 1972 
six week Secondary Science Training Program (SSTP) at the University 
of Lowa, Iowa City. For data gathering on the variables under study 
the Test on Understanding Science (TOUS), the Watson-Glaser Critical 
Thinking Appraisal (WGCTA), and the PSSC Test of General Course Objec- 


tives were used. Students were pre- and post-tested on.all three 
instruments. 


Variables used in the assessments were: (1) "understanding the 
nature of science," (2) physics achievement, and (3) critital thinking 
ability. Data on these yariables were collected during the SSTP and 
compared with data reported by other’ authors for groups of physics 
students, high ability students, and physics teachers, JZero-order 
correlations, multivariate correlations, and regression analysis of 
variance were used to study the relationship between the criterion 
("understanding the nature of science") and independent ("Critical 
thinking ability'’) and ("Science achievement") variates. 
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Findings 


Results of the pre- and post-test comparison for the physics- 
astronomy students of the, lowa SSTP gave a nonsignificant difference 
(level of significance of .05) in their understanding of the nature 


of science as compared by TOUS. A positive change was observed, how- 
ever. 


« : 7 a 


Comparing results of the SSTP high-ability group with TOUS means 
of physics students in general, a difference ranging between 3 and 7 
points was observed. The Iowa SSTP high-ability group obtained a mean 
of 43.2. The national norms (tentative) of 1lth and 12th grades as 
established by Cooley and Klopfer (1) have-medns about 11 points below 
the Iowa group. Similar means scores were obtained by the SSTP high- 
ability groups and other high-ability or high-IQ groups. 

Findings of Rothman (2) and Welch and Walberg (3) indicate a wide 
range of thean scores for groyps of secondary school physics teachers: . 
a range of 41-50. The Lowa SSTP group score higher in two out of five 
reported studies. 


L 


"Critical thinking ability" and "science achievement" appear to 
be moderately correlated (range of .47-.62) with "understanding ‘the 
nature of science," both for the high ability STTP group and other 
student groups. In the test for linearity, a non-significant departure 
from linearity (.2 level of significance) was established. However, 
no significant increase in the power to predict."understanding the 
nature of science" was achieved by adding "critical thinking ability" 
as a second independent variable to "science achievement" in the 
regression equation. 


s 
The author's general conclusion of the study is ", . . that 
‘understanding the nature of ‘science' is cognitively speaking, Largely | 
independent of ‘critical thinking ability' and.'achievement in science’ 
that many physics teachers have less understanding of the nature of 
science than high-ability secondary students, and that TOUS scale I 
- may not: be capable of functioning to “discriminate the level of ability 
of a substantial portion of high ability students . . .". 


concludes, after analyzing the non-significant TOUS gains 
for the lowa group, "that in order to achieve significant growth in 
TOUS scores, the specific concepts and knowledge of the nature of 
science . .. have to be taught, rather than expecting this knowledge 


to be assimilated via a kind of osmotic processes during the regular 
science activities. 


\ 


Another inference about the non-significant gain observed is that 
the instrument utilized (TOUS) may not be entirely appropriate, espec- 
fally Scale I. Eight students (30 percent of the sample) missed no 
more than two items on this scale on the pre-test, Suggesting that a 
"ceiling effect" may have occurred. 


. 


o 
The small to medium correlation coefficients = WGCTA 
and TOUS: suggest that probably other factors’ have "a great deal mere 
than critical thinking and science achievement" to do with understanding 
the nature of science. : 


~ 


t : ABSTRACTOR'S ANALYSIS \ 
The purpose of the study is of great importance and should provide 

valuable information for those engaged in science teaching and/or 

“developing science teaching materials. But the conclusions obtained, 
although they cannot be overluoked, have to be considered with caution. 
The main concern with the study is the way the sample of students was 
selected. Nine students were from Iowa and the rest (20) from other 
states. No additional information, besides being high-ability students, — 
about them or how they were selected, was given. This poses a great 
doubt on whether the generalizations made can be extrapolated to the 
general population of high-ability students. If this is the case, in 
my opinion, the author's findings need to be confirmed by other investi- 
gators. - 


Another concern which needs to be considered is the lack of infor- 
mation given by the author in regard to the validity and relfability 
measures of the different instruments utilized. Only on TOUS was this 
information readily available. ; 

Some of the data from other authors for making the comparisons 
were incomplete, i.e., they did not report standard deviations, infor- 
mation in regard to sample collections, etc. Due to these missing 
data, the author in making his analysis, rested on assumptions about 
the data that do not necessarily hold. This was unfortunate. 


It is important to note that the author was conscious of these 
limitations and pointed them out wherever they existed. 
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Purpose : , 


The purpose of the study was to identify and assess intuitive and 
analytic modes of thinking in physics, to associate these modes with 
Ausubel's theory of learning, and to estimate learning efficiency of 
Students whose approach is high or low I (intuitive), or high or low 
A (analytic). The hypétheses were concerned with the relationship 
between the A and I qeeretos and various related parameters. They 
were: 


Hypothesis I: The A dimension is more highly related to 
scholastic ability than is the I dimension. 


Hypothesis II: (A) Students rated high I will achieve at 
significantly higher level than those rated 
low I; (B) students rated high A will achieve 
at a significantly higher level than those'rated 
low A. 


Hypothesis III: (A) Students rated high I will spend less time 
in learning than those rated low I. (B) Students 
rated high A‘will spend less time in learning 
than those rated low A. 


Hypothesis IV: (A) Students rated high I will ve more efficient 
in learning than those rated low 1; (B) Students 
rated high A will be more efficient in learning 
than those rated low A, 


Rationale 


A theory of learning proposed by D. P. Ausubel (1968) was utilized. 
In this subsumption theory a "differentiated cognitive structure" is 
the prime determiner of success in learning subsequent related subject 
matter. Ausubel views cognitive structure as a hierarchical set of 
concepts (subsumers) to which new information can be associated during 
meaningful learning. When no relevant subsumers are available for 
"anchorage" or association of new information, rote learning is 
necessitated until some rudimentary relevant concept is formed. In 
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an earlier study at Cornell Universi » Ring and Novak» (1971) have 


demonstrated the importance of subsuming pomeep es in chemistry to 
i subsequent cchievenent. i 


3. ‘ 
The highly I individual, it is conjectured, would possess the 

superordinate ideas and higher level subsumers necessary to enable 
him to move across the upper levels of Figure 1 with frequent referrals 
to (and from) subordinate concepts. The highly A ‘individual, it is 
conjectured, would be very effective at regenerating the lower level, 
subordinate ideas and would therefore move primarily from the sub- 
ordinate to the superordinate concepts as shown in Figure 2. The 
analytic dimension is. thus treated as an aspect for information- 
processing ability similar to that set forth previously in a model of 
concept formation (Novak, 1965): This particular aspect of informa-_, 
tion: processing would play a very important role in such mathematically 
oriented and highly structured subjects. as physics.° 


Research Design and Procedure ; 
R Twenty-five subjects were randomly selected from the 70 auto- 
tutorial (A-T) students. Each of-the 25 participatéd in a problem 
solving interview session which was taped for subsequent analysis. 
From the analysis of the interviews, I and A ratings were assigned 
to each subject. Four groups of subjects were established according 


to extremes of the I and A ratings. These groups were then compared 
on various learning-related parameters. 


In addition to the 1 and A ratings, for each of the 25 subjects 
‘in the sample, the following data were also obtained: 


. 


a) scholastic aptitude test scores, verbal and math (SATV, 
SAT) ; 


b) achievement on major course exams and weekly quizzes; 


. 


c) weekly time spent in learning as recorded in the A-T 
center; and 


d) weekly and total learning efficiency scores (learning 
efficiency defined as achievement divided by associated 


learning time; see, later in this paper, a description of 
learning efficiency). 


Findings 


Hypothesis I: "The analytic dimension- is more highly related 


to scholastic ability than is the intuitive 
dimension " was supported. The high A, low IL 
student did not differ significantly on either 
SATV or SATM scores. The student rated as high 
. and low A did differ on both the SATV and SATM 
; scores. These data support the authors’ 
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SUPERORDINATE 
CONCEPT 


SUPERORDINATE 
CONCEPT 


Subordinate 
Concept 


High Intuitive Individual Moves Freely From One Superordinate Concept, to Another with 
Frequent Referral Primarily 70 (and less frequently FROM) Subordinate Examplars. 


Figure t. Representation of the conceptual organization in cognitive structure of athe high indt 
vidual and the relationship to cognitive funchoning 
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SUPERORDINATE 
CONCEPT 


Subordinate 
Concept 


High Analytic Individual Moves Primarily Within Subordinate Concepts and To Superordinate 
Concepts, with Referral Back to Subordinate Concepts, Thus Expanding the Superordinate 
Concepts. (Very little, if any, exchange between Superordinate-Concepts.) 


Figure 2. Representation of the conceptual organization in cognitive structure of the high indi- 
vidual and the relationship to cognitive functioning 
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7 other ree were also ‘supported: 


Hypothesis II: 


osition that SAT tests tend to assess lower $ 
eras. concepts and isolated bits of knowledge. 


A) students rated high I will contin at 
ignificantly -higher level thdn those rated 
I; (B) students rated high A will achieve 


ait a significantly higher level than those rated 


low A. 


| ’ 


Hypothesis III: (A) students rated high I will spend less time 
in learning than those rated low 1. (B) students 
rated high A will spend less time in: learning 


than those rated low A. 


4 


. 


“ft 
Hypothesis IV: (A) students rated high I will be more efficient 


’ \ in learning than those rated low 1; (B) students 
site rated high A will be more efficient in learning 


than those rated low A. 


° 


For low I, 
stant, while fr high I, 
and consistently. 


. 
4 


Interpretations : Ta tee 


« 
\a 


s 


low A students, efficiency of learning was about con- 
high A students efficiency increased gradually 


; 


‘the following general conclusions can be drawn from this study. 
It is possibte to identify consistent and reliable “individual dif- 
ferences in problem solving approach and tq categorize an individual's 
preferred mode of attack using as a basis the I and A dimensions. 
Individuals appear to be reasonably stable in their approach from 


problem to problem, particularly on the A dimension.: 


One of the c 


Sola variables relating to the approach an indi- 


[vidual uses in problem solving is the degree of differentiation of 

. his cognitive structure and the concomitant availability of subsuming 
concepts. The individual who possesses the global, superordinate conr 
cepts in a discipline (high I) and also has the ability td reconstruct 
lower level concepts when and if needed (high A) is at a significant 


advantage in terms of achievement and learning efficiency. 


. 


The individual who possesses the ability to regenerate subordinate 
concepts (high A) but. lacks the overall subsuming concept (low I) finds 
it necessary to sptnd large amounts of learning time resulting in low 


‘efficiency. 


There is some evidence to suggest a progressive facilitating 


effect of A and I ability on new learning. 


steadily increased in the ~ 1, high A group. 
for the groups were not as gbunced. 
br 
$ 
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The learning efficiency 


Increases in efficiency 


ABSTRACTOR'S ANALYSIS 


‘ This is one of ‘a gtowing number. of researches on how children 
. and youth learn. Most of these studies,. especially those dealing 
with the learning:of young children, are largely influenced by Piaget's 
work., These studies have done much to make teachers aware of what they 
. may reasonably expect of children and of the need for a wide range of 
specific experiences as a base for the development of logical abstract 
“thinking. These studiés have also made secondary and college teachers 
. awgre that a relatively high percentage of their students are thinking 
at the cqncrete level rather than the abstract level‘and of the necessity 
to adjust thelr courses to provide the essential experiences. Fur ther- 
’ more,“older students able to think abstractly in a familiar area cannot 
do so in a field‘utterly new to them. So-called concrete experiences 
in the new field are required for them to move to a logical, analytical 
way af thinking. | : 


ri 


Recently there has been an ‘ineréasing number of’ researches focusing 
upon, the learning “theory which D. P. Ausubel proposed in 1968. His 
_ theory is concerned \with the concepts one brings to a new ‘learning 
‘situation, and with the’ relation of these concepts to new instances 
available in the’ new learning situation. He holda that a "differenti-y 
ated cognitive structure" which permits, subsuming new concepts of 
subséquent, related matter into this. structure facilitates efficiency 
“in learning; that unless there ‘be this differentiated cognitive struc-: 
ture with higher order cancepts available, isolated’ experiences are 
qtickly forgotten. But if this cognitive structure of higher order 
concepts is. already present in the learner,‘then the new experience : 
. reinforces or enhances the concepts. already held. Furthermore, the 
incidental ‘experiences may be forgotten While the more inclusive 
abstract concepts are held, yet the learner can reconstruct similar | 
- specific experiences if need be. As students move to progressively * 
‘higher levels {n school, this differentiated cognitive structure 
becomes more necessary an dealing. effectively with the cémplex situ- 
ations that arise. cs j . 
By categorizing the subjects fin this.study as low I, low A;- 

low I, high A; high I, low A; and high I, high A, it was poss{ble to 
_ggin new insights into how learning takes place. 


Even though the number of subjects was limited to 23, the taped 
interviews ‘enabled categorizing them as above, and the observer showed 
a high level of agreement in ranking on both the intuitive and analy- | 
tic dimensions. The study gave valid results, the research’ was 


designed such that thé study. could be ie oa replicated by others, ~~ 


and the report was wel] written. F ee 

: There has been but cufarively little research in this particular 
' aspect of learning. . The replication of this study in other sciences 
and at other levels could greatly extend present knowledge concerning 
how learning can proceed most effectively. 


- — commend this study to any researcher concerned with improving 
teaching dnd learning. 


INSTRUCTION 


21 


17 


EJ 100 917 


Trojcak, Doris A., "Training in Programming Instruction and Student 
Achievement." Journal of Research in Science Teaching, Vol. 11, 


No. 1:39-44, 1974. 
Descriptors--Academic Achievement, Autoinstructional Aids, 
*Educational Research, *Elementary School Science, Grade 4, 
*Preservice Education, *Programed Instruction, Programed 
j Materials, *Program Evaluation, Science Education, Teacher 
Developed Materials 
. - 
Expanded Abstract and Analysis Prepared Especially for I.S.E. by“ 
David R. Stronck, Washington State University. 
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J Purpose 


The purpose of this investigation was to, determine the influence 
of five stages of instruction on the prospective teacher's ability to 
construct programmed science activities for. fourth-grade students. 


The null hypotheses tested (not stated explicitly by the researcher) 
“were the following: ; : 


~ 1 


There is no significant difference in ‘scored on a test in which ‘ 
ten objectives for teaching heat tonductions to fourth graders were - 
to be sequenced betWeen prospective teachers: 


1. who completed one stage of ‘an instructional packge me 


(a) those who completed two stages; - 
(b) those who complp-ted three stages; 
(c) those who compyeted four stages; 
(d) those who completed five stages; 
(e) those who completed no stages; 


a 


2. who completed two stages of an instructional package and 
(a) those who completed three stages; 
(b) -those who completed four stages; 
(c) those who completed five stages; 
(d) those who completed no ‘stages; 
3. who completed three stages ,of an instructional package and 
(a), those who completed four stages; 
(b) those who completed five stages; 
(c) those who completed no stages; 
4. who completed four stages of an instructional package and a5 


(a) those who completed five stages; 
(b) those who completed no stages; 
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5. who completed five stages of an instructional package and 
those who completed no stages. 
a 


A second major set of hypotheses was the following¢ 
é 


N 


There is no significant difference in scores on a 16 item multiple 

choice test on friction between fourth-grade students: 

1. who were given programmed instruction prepared’ by the 
experimental methods students and those who were not given 
programmed instruction; or i 

2. who were given programmed instruction prepared by the 
experimental methods students who spent more than 12 
hours developing their programs and those who spent, less ‘ 

‘ than 12 hours. 
For both hypotheses which he the scores on the test com- 
- pleted by the fourth graders, there are subhypothesis for each of 
the five stages of instruction by which the prospective teachers were 
prepared to construct programmed science activities. 


Rationale 


One of the most important current educational developments is 
auto-instructional programming. According to D. J. Klaus (2) in his ° 
article "The Art of Auto-Instructional Progratming," the commercially. 
availabie programmed materials will be uséd responsibly and wisely 
‘if the teachers are given training for designing some of the programmed 
materials which their students will‘use. In this study, procedures 
for training prospective teachers to construct and implement auto- 
instructional materials were developed and evaluated. The five-stage 
instructional package for the prospective teachers was based especially 
on R. M. Gagné's article "Instruction and the Conditions of Learning" 


(1) and S. Thiagarajan's Programmed Programing: Part VI, Developmental 
Testing (3). 


td 
Research Design and Procedure 


The research design used throughout this investigation was Post- 
test-Only Control Group. 


The prospective elementary school teachers were divided into 
two groups: (A) 36 science methods students who received no programmed 
instruction, and (B) 55 science methods students in the experimental 
group.’ This latter experimental group was subdivided into five treat- 
ment: groups of eleven’ students in each: (1) whose who received only 
stage one of the instructional package, (2) those who received stages 
one and two, (3) those who received stages one, two, and three, (4) ‘: 
those who received the first four stages, and (5) those who received 
all five stages. : 

23° 
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The fourth-grade students were divided into two groups: (A) 118 
fourth graders who were not given programmed instruction, and (B) 275 
fourth graders in thé experimental group. This latter experimental 
group was subdivided into five treatment groups of 55 students in each: 
(1) those who were given programmed instruction prepared by the experi- 
mental methods students described in treatment group "1" above, (2) 
those who were given programmed jnstruction prepared by the methods 
students in group "2" above, and so on for the five groups. 

a 

The participants in all samples were randomly selected. The 
entire study was done within one semester. Each of the five stages of 
the instructional package required two or three 50-minute class periods. 
The time required for constructing programs by the prospective teachers 
averaged 12 hours but never exceeded 20 hours. The average completion 
time for the instructional programs done by fourth graders was 40 


minutes. The average completion time for the test taken by the fourth 
graders was 25 minutes. 


, Duncan's Multiple. Range Test was applied to determine the signifi- 
cance of differences among mean scores of the six teacher groups on 

the sequencing of instruction test and of the six fourth-grade groups 
on the friction test. .-A second analysis of data consisted of a 2 x 5 
factorial analysis of variance to determine the effects of the amount 
of time spent in preparing programs on fourth -graders’ achievement 

from using the programs. 


Findings 


Duncan's Multiple Range Test revealed no ‘significant differences 
among the mean scores of the experimental groups. Nevertheless, all of 


the experimental groups obtained significantly higher scores than the 
control groups. 


The 2x 5 factorial analysis demonstrated a significant difference 

\ in mean achievement scores obtained by the fourth-grade students 
associated with the amount of time that the prospective teachers spent 
in preparing the instructional programs. Those fourth graders who 
used programs which were prepared with more time obtained higher scores 
on the achievement test. Moreover, significant differences were found 
indicating that increasingly higher achievement levels were associated 
with increasing amounts of training-in the five stages of the instruc- 
tional package. This last finding comes from recording the systematic 
relation between the proportion of prospective teachers who spent 
greater amounts of time developing programs and the amount of training 
they received. The amount of training was‘directly proportional to 
the amount of time given to developing programs. 


Interpretations 


The researcher concluded: "The instruction in programming was 
instrumental in facilifating the prospective teachers' abilitv to 
logically sequence objectives and that fourth gradets did learn from 
the programs they used. It was also chear that the full five stages 
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of instruction had a greater effect on developing the prospective 
teachers ability to design effective instruction than only one or two 
stages of instmiction. . . . The amount of time spent by the prospec- 
tive teacher in developing the programs was related to the achieveme 
produced and was found to be a significant factor." 


kts 


ABSTRACTOR'S ANALYSIS 


. 


The investigator has done some very interesting work in the 
development and evaluation of programmed science activities constructed 
by Prospective elementary school teachers. The yecent trend toward 
individualized instruction has made this an ited ant new topic. 

- Because her study involved the use of new instruments in a new area 
of investigation, some of the difficulties in ‘obtaining complete data 
or replicating the data with. larger samples are of minor importance 
with the basic value at this novel research. 


Her first nictiee was the inadequacy of-the tests to provide 
significant differences according to the original hypotheses. One 
test required the prospective teachers to sequence ten objectives for 
teaching heat conduction to fourth graders. A second test asked 
fourth graders to respond to multiple choice questions on friction. 
Duncan's Multiple Range Test revealed no Significant differences among 
the mean scores of the experimental groups; however, all of the experi- 
mental groups obtained significantly higher scores than the control 
groups. But these control groups provide relatively little informa- 
tion because of these rather obvious assumptions: (1) prospective 
teachers with some instruction on preparing programmed materials will 
significantly better sequence ten objectives for teaching a unit than 
will the prospective teachers without such instruction; (2) fourths 
grade students who completed a programmed unit on friction will have 
siggificantly higher achievement on a multiple choice test. on the 
topic of fraction than will those without such instruction. If the 
use of Duncan's Multiple Range Test had not demonstrated significant 
differences between the control groups and the experimental groups, 
the data would have shown the work with the prospective teachers to 
be a serious failure. 

j ‘ . 

The inveMigator noted: "The analyses suggest that further 
research should be undertaken to determine if indeed instruction in 
programming can aid prospective teachers in logically sequencing objec- 

ives and that fourth graders did learn from the programs developed." 

stated, this suggestion seems unnecessary because the study did 

eveal the basic success of the instruction in programming as a signifi- 
cant aid in helping prospective teachers to sequence objectives 
logically. But the tests used did not distinguish between the vari- 
ous stages of the instructional package. The data seem to indicate 
that only one stage is needed to achieve an adequate abilit® The 
investigator describes stage one: "This segment of instruction focused 
on a review of performance objectives and their relationship to pro- 
grammed instruction. The students were given a prototype program. . . 
{which] exemplified a sequence of prerequisites leading to problem 
solving ability according to Gagné. . . . Three 50 minute class 
periods were used to complete stage one." Although the other four 
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‘stages prqvided more inférmation on the general topic, the additional 
,hours’ did not significantly improve the ability of the prospective 

teachers to sequence objectives or the foyrth-grade students to 

achieve by using the instructional materials provided by those pro- 
‘espective .teachers. A conclusion not recognized by the investigator 

is to reduce the five stage instructional package to only ‘one stage. 

This reduction would also make the package, more attractive for use by 


other science educators.who may wish to consider the same topic in 
a methods course. : ‘ : 


The study also showed’ that those fourth’ graders who used pro- 
grams which were prepared with more time obtained higher-scores on 
the achievemerit test. This is important information., Some teachers 

‘ seem to justify careless preparation‘of instruction by assuming that. _- 
teaching is an.art which is significantly improved only by practical 
experience. But this study shows that programmed materials are 
significantly improved simply by dedicating more time to the initial 
writing. From these data the investigator could arrive at an appro- 
priate definition of the assignment in terms of hours which each 
Prospective teacher should give to constructing the instructional 
material. This topic was not developed in the article. 


The amount of time which the prospective teachers gave to 
developing programs systematically increased with the amount of train- 
ing provided in the various stages of instruction. This datum indi- 
rectly shows that the attitude of the prospective teachers toward 
developing programs improves with increased training. The investigator 

_ recommended: "New instruments tq measure the sequencing and/or pro- 
Pei emon ability of teachers and professional programmers should be 
veloped." The abstractor recommends that instruments should be 
developed also to gather data on the opinion of prospective teachers 
and children toward programmed instruction. The abstractor has used 
such instruments tq,measure the attitudes of prospective teachers at 
Washington State University. He has been discouraged by the general 
rej@ction of programmed instruction by prospective elementary school 
teachers who seem to prefer other modes of instruction. Therefore 
perhaps the most valuable contribution of the investigator's article 
is her success at increasing the: acceptance of programmed inetruction 
through her five stage package of instruction. Her study did not 
attempt to identify the attitude of the fourth graders toward pro- 
grammed instruction in comparison with other modes of instruction. 


Although the study has an appropriate research design, the 
value of the five stage package of instruction will be better analyzed 
when it is used and evaluated by other instructors of science methods 
on additional campuses. Hopefully this package will allow a greater 
introduction of programmed instruction into the elementary schools. 
Recently commercial companies have published excellent materials for 
individualized instruction. But the acceptance and appropriate use 
of these materials remain a major problem in most of our nation's 
schools and methods courses. 
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Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
Paul C. Beisenherz, University of New Orleans. 


The purpose of this study was to investigate the effects of two 
sets of learning codditions (teacher-structured and student-structured) 
on student performance. Eight hypotheses were generated and tested. 


1. There is no difference in enf related scores (dependent 


variable involving the fraction of student behaviors recorded 
in the lesson-related categories of the SCAS system) between 


students in the student-structured classroom and the teacher- 
structured classroom. 


The interaction effect of learning conditions by student 
class ‘Kank is zere. 


There is no difference in the LN’scores (the fraction of 

student behaviors recorded in each of the nine L categories—- 
. the dependent variable) between students in the student- 

8tructured classroom and the teacher-structured classroom 


The interaction effect of Lenenaitg conditions by student 
class rank is zero. 


There is no difference in'the mean posttest scores on the 
TAB test (dependent variable) between students in the student- 
structured classroom and the teacher-structured classroom 


The interaction effect of learning cot@itiors by student 
‘class rank is zero. . 


There is no relationship between the amount of direct 
teacher-student interaction (during a "one-to-one" inter- 


action) and the behavior scores of the involved student in : 
‘the student-structured classroom. 


There is no relationship between the amount of direct 
teacher-student interaction and the behavior scores of the 
involved students in the teacher-structured classroom. 


Rationale 
—_ “a 


The rationale for the study was provided by the paucity of 
studies that adequately identified a cause and effect relationship 
between teacher behavior and student performance. To establish more 
clearly the relationship between various aspects of instructional 
strategies and student behaviors, the authors identified the need to 
provide a more systematic and deliberate control of extraneous fdctors. 

Cl 
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Research Design and Procedure 
‘ ~ 


Fifty-two fifth grade students at the Florida State University 
School were randomly assigned to two classrooms. Both groups were 
taught science by the same teacher using the same materials and class- 
room facilities, differing only in the patterns of teacher behavior. 
Using a listing of teacher behaviors found in Matthews’ "SCAS Class- 
room Interaction Categories--Teacher Behaviors," five categories were 
identified that characterized the teacher's pattern of behavior in 
the teacher-structured classroom. Likewise, four categories char- 
acterized the student-structured classroom teacher behavior pattern. 
By determining the fraction (Learning Conditions Index) of the above 
behaviors in each of the two patterns of behavior to the total number 
of behaviors recorded using the SCAS categories of teacher behavior,. 
a level of acceptable teacher behavior was established for each 
pattern. Having established the apprapriate levels of teacher behavior 
in the student-structured and teacher-structured classrooms in the 
first five weeks of school, observational data on student classroom 
behavior bd se collected by a team of nine observers during the 
second five Meeks of school. The behavior of six students in the 
’ student-structured and teacher-structured classroom was coded daily 
for the duration of the science period. 


Hypotheses 1 through 6 were tested using analysis of variance 
techniques. Hypotheses 7 and 8 were tested using a nonparametric 
technique, the Spearman rank correlation coefficient rho. 


Findings 

Abbreviated "source tables" with corresponding F-ratios for 
Hypotheses 1 through 4 were tested. Significant differences (at the ~ 
0.05 level) were reported and are summarized in the next section. 


Analysis of the TAB test data revealed a significant difference 
- (0.05 level) in the inquiry skills of the students in the two sets 
of learning conditions in favor of the student structured students 
(Hypothesis 5). Hypothesis 6 was not rejected. Estimates of the rho 
values revealed significant (0.05 level) relationships between the 


teacher-student interaction scores and the student behavior scores 
(Hypotheses 7 and 8). 


Interpretations 

The students exposed to-the directive teaching of the teacher- 
structured classroom performed very differently from the students 
exposed to the nondirective teaching of the student-structured class- 
room. The authors state that the data suggest that, "given a\class- 
room in which materials are made available to the student, a téacher 
who frequently : 


- 
. 


1. makes statements (including questions), which tell the 


students what pa how to do an activity; 
: 29 
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praises or evaluates students for ideas or behaviors; and 


Ze 
3. rejects. and/or discourages student behaviors; 


tends to have students who 


1. spend more class time observing the teacher; 


2. spend more class time following teacher directions regarding 
what actiyity to do and/or how the activity should be done; 


spend less class time doing activities in which no specific 
teacher directions are followed, i.e., doing an activity 
of their own design; 


spend less class time responding to teacher questions; 


spend more class time initiating (or attempting to initiate) 
interaction with the teacher and continuing self-initiated 
interaction with the teacher; 


spend less class time receiving ideas from another student 
(who is not demonstrating for the teacher), and 


7. spend less class t{me giving tdead to another student 
(not at the request of the teacher)." 


In addition, the authors state that behavior scores "show clearly 

,. that the students in the nondirective learning environment of the 
student-structured classroom functioned as effectively as, if not more 
productively than, the students in the teacher-structured classroom. 
More importantly, the behavior scores reveal that the students con- 
formed to the behavior pattern of the classroom teacher." 


The authors interpreted the student observation data and the 
scores on the TAB test as suggesting that a nondirective, nonevaluating 
learning atmosphere can be established in an elementary science class- 
room with no loss of student involvement or constructive activity, 


ABSTRACTOR'S ANALYSIS 


This study represents a serious attempt to examine an 
important topic--effects of teacher behavior on student performance. 
It has been mentioned in earlier “Abstractor's Analysis" that many 
of the questions concerning research design, procedure, and treatment 
of data are quite possibly a direct result of the constraints placed 
upon authors in terms of article length. Many of the following dif- 
ficulties encountered by the abstractor in interpretating this study 
are of this nature. 


That both groups Of students were taught by the same teacher 
raises questions concerning the teacher's ability to satisfactorily 


present both patterns of behavior. While the "Learning Conditions 
Index (LCI)" was a valuable aid in defining acceptable teacher 
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behavior, other more subtle behavior patterns might have been operating 
during the ten week treatment period. It would appear difficult for 
many teachers to demonstrate behaviors not typically part of their 
normal behavior pattern. 

The authors stated (page 161) that "observational data on student 
classroom behavior were then collected by a team of nine observers 
during the second five weeks of school. The behavior of six students 
in the student-structured and teacher-structured classroom were coded 
daily for the duration of the science period..." No estimate of 
intercoder reliability was reported. Also, information identifying 
the coders, their training, how they were employed in the classroom, 
and their effect on the classroom environment would have been helpful. 
In addition, it was not clear how the sample of six students was 
selected. Were the same six students coded daily? Were there three 
or six students selected from each classroom? No mention was made 
concerning classroom events during the treatment period. 


In identifying a profile of teacher behavior in both student- 
structured and teacher-structured classrooms (Table III), it is 
not clear how the "percentage of total class time" was determined. 
How were the incidences of the specified behavior categories con- 
verted to class time? If the dominant behavior occurring within each 
five or ten second period was recorded, then potential sampling 
problems might exist. Also, the percentage of-total class time 
devoted to the behavior category, "follows teacher directions," for 
student-structured students was reported (in Table VII) to be 0.0. 
It is interesting to specualte on the classroom interaction patterns 
in this particular classroom environment. 


Table VI contains "scores representing the total time the teacher 
interacts nonverbally with a student in a one-to-one situation." 
How was this time determined? Even if operationally defined, measure- - 
ment of this class of behaviors would appear to be quite difficult. 


Also of importance in this study is the distinction between the 
two treatment groups--teacher-structured and student-structured. 
While the individual behavior categories appear satisfactory, the 
overlapping of categories between the two groups suggests a vaguely 
defined construct that makes comparisons difficult. 


In examining the differences in teacher behavior categories 
between teacher-structured and student-structured classroom inter- 
action, it appears to the abstractor that the degree of teacher 
guidance is of some importance to the study. The studentestructured 
pattern implies less guided learning while the sieinbatriaares 
pattern suggests more highly guided learning. However, inquiry, as 
a strategy of instruction, can result from both patterns. If so, 
it appears difficult to separate inquiry from non-inquiry strategies 
using the teacher-structured and student~structured categories. 
Because the TAB test was designed to evaluate a child's ability to 
demonstrate selected inquiry behaviors under specified conditions, 
the use of studenfé scores on the TAB test as a dependent variable 
to test Hypotheses 5 and 6 could be questioned. However its use in 
this i raises interesting questions for future investigations. 
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Future research could progress in several directions. For 
example, studies could be conducted that identify behaviors students 
demonstrate when taught particular concepts using specific instruct- 
ional strategies (each strategy Operationally defined by a limited 
. number of teacher behaviors). 
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Raven, Ronald J., "A Multivariate Analysis of Task Dimensions Related 
to Science Concept Learning Difficulties in Primary Schoo] 


Children." Journal of Research in Science Teaching, Vol. 9, 
No. 3:207-212, 1972. 


Descriptors--*Concept Teaching, *Cognitive Development, 


Elementary School Science, *Learning Theories, *Scientific 
Concepts, *Tdsk Analysis 


Expanded Abstract and Analysis Prepared Especially for 1.S.E. by 
Edward L. Smith, Michigan State University. j 


a 


Purpose ! 


The problem addressed was the need for information concerning 
the difficulty levels of various conceptual tasks for use in elementary 
school curriculum development. The purpose was to "determine if 
achievement in a series of problems was affected by differences among 
task values within a dimension on which the tasks were described." A 
The following hypotheses were stated: 
1. There will be a difference among tasks that have different 
inference patterns. 


2. There will be a difference among tasks that have different 
goal objects.’ 


3. There will be a difference among tasks that have different 
percepts. ; 


Studies by Bruner (2), Inhelder and Piaget (6), and Elkind (5) 
are ‘cited as evidence of the importance of "cognitive operations," 
"content capabilities," and "perceptual abilities" respectively in 
the solution of concept tasks. The author points out that although 
substantial work hay gone into investigating these abilities, relatively 
little has been done\to identify task dimensions which affect the’ , 
level of performance concept tasks. Studies by Osler and Kofsky 
(8) by Smedslund (N.) had previously found performance to be 
affec by/varying tasky along specified dimensions. * 


; ree; dimensions used\by Smedlund were adopted for the present 
st’ 3 f 


1. / Inference pattern - “\The organizational pattern of the 
/-problem which demands \specific sets of mental operations 


for solutions" [Values\ compensation (C), transitivity (T), 
coordination (R)] 


2. Goal object - "The end result of what the pupil is instructed 
to attain" [Values: speed (C), mass (M), momentum (P)] 


3. Percept - "What one perceives in the entire problem situa- 
tion" [Values: weighted balls pushing boxes (a), interacting 
collision carts (b), moving toy cars (c), balls of clay (d)] 


The author views these task dimensions as corresponding to the three 
types of learner abilities listed above, and therefore likely to be 
useful in characterizing the difficulty levels of concept tasks. 


Although not stated, the study seems based on the following 
underlying assumptions: 


1. Concept tasks such as those employed in research on concept 


attainment and cognitive development should be includgd in 
curricula. 


/ 
2. The level of difficulty of such tasks prior to instruction 


on them should have a direct relationship with their place- 
ment within the curriculum. 


Research procedure and Design 


Sample: Twenty boys and twenty girls were randomly selected 
from each of the kindergarten, first, second, and third grades of an 
elementary school characterized as having a largely middle class 
population. 


Tasks: Eight tasks were used in the study (see Table 1). The 
tasks represent 8 of the 36 types of tasks generated by combining afl 
values of the 3 task dimensions (3 inference patterns x 3 goa jects 


x 4 percepts). The author does not indicate how these eight tasks 
were selected. 


Procedure: The eight tasks were administered in random orders 
to individual children. Each task was administered twice to each 
child. A positive score required correct answers to the criterion 
question in both instances. The magnitude of the positive scores was 
not reported. Zero was assigned when one or both questions were | 
missed. Brief descriptions & each task were presented by the author. 


Design: Each of the three task dimensions was used as an inde- 
pendent variable. Each child was given all eight tasks making this 
a repeated measures (within subjects) design for these; independent 
variables. In addition, grade and sex represent\potential between 
subject (blocking) variables. The dependent variables were the task 
scores combined according to the values of the task dimensions to 
which they correspond. e author does not report how the task 
scores were combined (e.g., mean, sum, etc.). 


In terms of the Campbell and Stanley (3) notation, this design 
comes closest to the Counterbalanced Design (No. 11). \However, the 
author does not indicate that the design was strictly counterbalanced 
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TABLE’ 1 
DIMENSIONAL VALUES DESCRIBING THE TASKS USED IN THE STUDY* 


Dimension 


Task qe Pattern Goal Object 


CSa 


S must adjust (and justify) the speed of unequally weighted 


balls to Gause similar boxes to move equal distances when 


struck. : 


S must adjust (and justify) the “mass of unequally weighted 
ballb to cause similar boxes to move equal distances when 


struck. 


. 


S must indicate (and justify) the relative number of 


bricks on two unequally wefghted collision carts which 
have traveled equal distances after interacting. 


After observing 3 toy cars moving at differing speeds 


and then establishing that a went faster than b and that 


b went faster than c; S must ,indicate (and justify) 


that a went faster than c. 


After observing 3 balls of clay of differing mass and 
then establishing that a has more clay than b, and 
that b has more than c, S must indicate (and justify) 
that a has more than c. 


After observing two pairs of collision carts inter- 
acting and establishing that a has more bricks than 
b and that b has more bricks ‘than c, 8 must indicate 
(and justify) that a has more bricks ‘than C-Thabal 


After observing two toy cars simultaneously entering 
and leaving cardboard-tubes of different lengths, S 
must indicate (and justify) which car went faster. 


Aftér observing two balls of clay, establishing 

that they had the same amount of clay, and observing 
one of them being formed into the shape of a hot 
dog, S must indicate (and justify) that they still 
have the same. amount of clay. 


compensation 


‘ 


compensation 


compensation 


transitivity 


transitivity 


transitivity. 


This table was prepared by the abstractor and represents some interpretation of the author's intent. 


weighted balls : 
pushing boxes | 


weighted balls 
pushing boxes 


interacting 
collision carts 


moving toy cars 


interacting 
collision carte 


. 


jf. 
moving toy cars, | 


balle of clay 


** This comment corrects an apparent typographical error which repeated the same requirement for the TPb and TSc tasks. - 


in the sense of equal representation of each task in each position 
(as would be the case with a Latin Square design for example). 
Furthermore, the individual task scores were combined in the analysis, 
making a Campbell and Stanley type diagram of the design used in 

the analysis inappropriate. To test the stated hypotheses, the data 


"+ were analyzed separately for each of the three task dimensions with 


the factors of grade and sex not included. The design for this 
analysis of data was therefore that illustrated in Table 2 for the 
Inference Pattern dimension. 


¢ TABLE 2 


REPEATED MEASURES DESIGN USED IN THE 
ANALYSIS OF DATA* 


= 
Factor " Inference Pattern 
(Diménsion) 
P Level Compensation Tamaee Coordination 
(Value) 
——— 
Subjects 1-160 1-160 1-160 


(CSa,CMa,CPb)** . (TSc,TMd,TPb)** (RSc, RMd) ** 


*Similar designs were used with the other two task dimensions. 


**The dependent variable was the combined-score for the tasks sharing 
same value of the task dimension as indicated. 


Analysis: The percentages of children at each grade level 
attaining a positive score on each task were determined and reported. 
Scores for tasks described by the same values of the task dimensions 
were grouped together and analyzed using analysis of variance tech- 
niques. The author reports that a multivariate analysis of variance 
was used but does not indicate what multiple dependent variables were 
employed. Planned comparison contrasts were used to test the signifi- 
cance of differences between groups of tasks for each dimension. The 
report does not specify how the contrasts were defined or what the 
reported degrees of‘freedom represent. The between subjects factors 
of grade and sex were “apparently not used in this analysis of variance. 


Findings 


Although not included in the analysis \of variance, inspection 
of the percentage data indicates that performance on all tasks improved 
consistently across grade levels. The analysis of variance indicated‘ 
that all tested contrasts represented significant differences (p.< 
-001). Thus, for the children tested, the level of performance dif- 
fered for groups of tasks representing differing values on each of 
the task dimensions. X 
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Interpretations 


No direct statement concerning the disposition of the hypothesis 
is made although-the implication is clear that they were supported. 
These results are viewed as showing "that achievement on the problems 
is a function of the dimensional nature of the tasks." The inference 
pattern result is interpreted as a result of the developmental sequence 
of the three types of operations required. The assertion is made that 
such differences would not be observed among individuals in the formal 
operational stage of development. 


~The author concludes that data on task performance should be 
viewed as a function of the configuration of the task dimensional 
values selected and not as an absolute characteristic of the value on 
any single task dimension. In particular, he states that failure to 
perform a given concept task does not necessarily imply that the ~ 
learner cannot acquire the concept. In applying this conclusion to 
the work of Inhelder and Piaget, he suggests that problems such as 
those dealing with the inclined plane, inverse square law and centri- 
fugal force might have been solved by younger children had the tasks 
been modified on one or more task dimensions. This point was not ‘ 
offered as a criticism of the work of Inhelder and Piaget, but rather 
as a caution in interpretation of their findings with respect to 
concept learning. 


The author proposes that the relative difficulty indicated for 
the values of task dimensions be used in sequencing instruction. 
Specifically, he proposes that a "multidimensional matrix of task 
values within each of the dimensions" be constructed. The matrix 
would map out a range of tasks reflecting various levels of under- 
standing of concepts. Such a matrix could be compared to a develop- 
mental scale of abilities to obtain information relevant to the place- 
ment and structure of curriculum content. 


9 ABSTRACTOR 'S ANALYSIS 


are relatively rare in science education although many such studies 
have been reported in the experimental psychology literature (1). 
The present study may thus may be as important for bringing this 
approach to the attention of science educators as for the specific 
results it reports. 


fistive of the effects on performance of varying task dimensions 


to cogmitive development. Its relation to the cognitive development 
literature can be clarified by distinguishing three types of cognitive 
development studies: 


mifetve dev article is included in @cluster of reviews related 


1. Studies describing or modeling the course of cognitive 
development. 


2* Studies attempting to accelerate or promote cognitive 
development. 
. y ® 
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3. Studies taking developmental factors into accouft in 
research on instruction on tasks not themselves viewed 
as indicative of developmental level. 


The first type of study uses observation of task performartce 
as a basis for inferring the nature of underlying structures or 
processes in terms of which the developmental level of the individual 
is characterized. The children are not given instruction and the 
tasks are not representative of learning resulting from specific 
instruction. This type includes most of Piaget's own work as well 
as the study by Smedslund to which the present article refers. 
Smedslund's stated purpose was "to determine the interrelations of 
the specific acquisitions of ability for concrete/reasoning, by 
means of a set of different test items" ana. & 

The second type of study assumes the validity of constructs 
characterizing the develcpmental level and uses task performance as 
a basis for inferring the level of individuals. Performance on such 
tasks is frequently used.as the dependent variable in experiments 
in which children receive instruction designed to advance or accelerate 
development (e.g., Linn end Thier, 1975). 


The third type of study is concerned with the relationship 
between developmental level and the effects of instruction on sail 
learning tasks. Like the second type, these studies assume the . 
validity of the constructs used to characterize developmental level 
and use performance on tasks to infer the developmental level of 
individuals. Inferences are then made about the learning patterns 
to be expected from given instruction. Frequently developmental level 
is used as a blocking variable or covariate with some measure of 
learning as the dependent: variable. In an analysis of research on 
concept learning, Voelker (12) argues that science educators should 
place greater emphasis on this type of research. He proposed the 
development of a taxonomy of science concepts to guide sych research. 


Although the present: study did not involve instruction and did 
use tasks similar to those fwequently used in the first two types of 
study, it is more apprépriately included with the third type. The 
tasks are considered t@phe potential candidates for inclusion i 
curricula designed to ‘tdach concepts. ,Raven seems to echo Vgefker)s 
recommendation by proposing the development of a matrix of 4onceptyal 
task dimensions. The ma:rix would map out important aspegts of 
potential instructional <asks. Knowledge of development €ould then 
be consulted in making decisions concerning selection and\placement 
of tasks in curricula. If coordinated, Raven's proposed rix and 
Voelker's proposed concept taxonomy might be highly complementary. 

a. § 


Two important questions arise concerning the interpretations of 
the results of the study. First, the major purpose of the study 
was to assess the validity of a dimensional representation of a set 
of concept tasks. The intended use of the task dimensions is to 
characterize many tasks, not just those in the present study. Since 
no qualification is made on the generalization of the findings to 
other tasks, the representativeness of the eight tasks selected 
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becomes very important. Are they representat:ive of the 36 categories 
gewérated by the defined dimensional values? The author does not 
indicate how the tasks were selected. If the tasks do not represent 
random selections from the values for each d:.mension, thén generaliz- 
ability to the set of 36 tasks is in doubt (‘ixed effects vs. random 
effects). There appears to be no basis at al for generalizing the 
findings beyond the range of values treated «.n the present study. 


A second question arises from the analysis of data. Do the 
obtained differences among combined scores for groups 0 
vide convincing evidence of the dimensionality of the taSks? Additional 
evidence might be obtained by examining individual tasks. Smedslund 
(11:27) argues that only when all dimensions except one are held con- 
stant can the effects of that dimension be assessed. Figure 1 indi- 
cates that in the present study, four comparisons meet this criterion. 
Three of these compare different inference patterns while controlling 
for goal object and percept. Performance on compensation was better 
than that on transitivity for momentum and percept b. This pattern 
also held for the two cases where only goal object was held const 
Coordination performance was superior to transitivity for mass an 


percept d but was almost identical to transitivity for speed and 
percept c. 


ks pro- 


Statistical tests of these differences cannot be made with’ the 
reported data. However, assuming that the two differences just 
cited are significant, the evidence for dimensionality of the tasks 
appears mixed. This result is consistent with Smedslund's. He con- 
cluded "that the generality of overt observable inference patterns 
over goal objects and percepts is quite limited" (11:27). 


The article could have been improved by inclusion of the addit- 
ional information indicated in the abstract (the positive scores 
assigned to correct responses, the method of selecting the tasks to 
be used, the method of combining scores for groups of tasks, and 
explanations of the contrasts examined, the degrees of freedom 
reported and the multiple dependent variables). In addition, a 
number of other_questions need to be answered for the reader to have 
a complete picture of the analysis of data and be able to assess its 
appropriateness: What error term was used in the analysis of variance? 
Was the analysis of variance for repeated measures employed? Was 


the reuse of the same data accounted for in the interpretation of 
the alpha level? 


Another useful addition to the article would have been precise 
definitions of the task dimensions and values. Such definitions 
would help the reader to assess the appropriateness of the tasks used 
and to identify other appropriate tasks for further research. Why, 
for example, does inferring the relative number of bricks on the 

‘ collision carts in the CPb task represent a momentum goal object? 
Also, Raven's coordination task for mass (RMd) is sometimes used as 
an indicator of ability to conserve quantity or substance (10). Is 
the coordination inference pattern similar or identical to the con- 
servation inference pattern? Elkind's analysis of the conservation 
task (4) is an example of the kind of definition required for sys- 
tematic cumulative research efforts such as that proposed by Raven 


39 


Sab 


connects scores for tasks 
differing on 1 dimension only 


1.0 ; connects scores for tasks 
Momen-(b) differing on 2 dimensions 
tum 


lower case letters refer to 
percept values 


(d) 


SS 
Compensation Transitivity Coordination 


Inference Pattern 


Figure 1 -- Graphic Comparisons Among Mean 


Task Scores 
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in the presenf article. Siegel and Hooper's collection of research 
based on Piaget's theory is an excellent source. 


Proposals for system’ research on a given topic or within a 
given framework abound / science education. For such proposals to 
get off the ground, a long range commitment by researchers is neces- 
y 4 If the proposer is not willing to make such a commitment, no 

e else is likely to either. The success of the present proposal 
for development ofya task dimension matrix will also depend on the 


development and use of precise definitions of task dimensions, values 
and tasks. 
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Purpose a 


This study sought to determine if the physical science 
unit, Interaction and Systems,!.from the Science Curriculum Improvement 
Study (SCIS) is meeting its objectives by comparing nonparticipating 
second grade children with those studying the SCIS curriculum, A long 


range purpose is to produce information to be used in guiding curricu- 
lum decisions. 


Rationale 


This is the second phase of a longitudinal (six-year) study with 
this science program and its objectives as a focus. Nearly the same 
population had been used in the first study, "An Evaluation of Certain 
Cognitive Aspects of the Material Objects Unit of the SCIS Elementary 
Science Program," Journal of Research in Science Teaching, Vol. 7, 
pp. 277-281, 1970. Allen sought in this second study to look at the 
strength of cognitive, motivational and affective factors and their 
interaction with the variables of sex and socio-economic status. The 
primary concern was with measuring children's understanding of the 
terms "Interaction" and "Evidence of Interaction," 


Research Design and Procedures 


In the initial study a random sample of 50 children for each of 
the three predetermined socio-economic levels was drawn from the popu- 
lation of first grade children in the Honolulu, Hawaii, Schools par- 
ticipating in the SCIS program. Socio-economic status was determined 
by teacher judgement and family income criteria. Each of the SCIS 
schools was matched with a non-SCIS school of approximately equivalent 
socio-economic levels and the same number of first grade children 
were randomly selected from this population. Therefore, a total of 
300 subjects participated in the first study. 


In this second study 213 of the original 300 first grade cl¥ildren, 
101 SCIS second graders and 112 non-SCIS second graders, participated. 
A fourteen item post-test was administered to these 213 second grade 
children. -The focal objectives. from the Interaction and Systems Unit 
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were: (a) to récognize changes that occur during an experiment; 
(b) to understand and use the word "interact"; (c) to understand and 
use the phrase "evidence of interaction"; (d) to use various senses 


in gathering.evidence of interaction; (e) to recognize evidence of 
interaction-at—a-distance. 


In view of these objectives the test items were divided into 
three sets. Set one dealt with favorable attitudes toward science 
and the scientific process. Set two questions focused on identifi- 
cation of interaction as evidenced by changes produced in liquid 
crystals when influenced by objects manipulated by each child. The 
third set asked the child to note evidence of interaction and recog- 
nition of changes in a burning candle system as evidence of interaction. 
A record was kept of the number of SCIS children who understood the 


terms "Interaction" and "Evidence" as reflected by their not requir- 
ing definition of these terms. 


Student responses to the fourteen items of the three sets were 
intercorrelated and factor analyzed by the principal axis method. The 
three factors were extracted and rotated to & Varimax Criterion. Factor 
one, considered a cognitive factor, consisted of seven items which ; 
dealt with recall or recognition of knowledge and the development of 
intellectual skills. Factor two, considered a motivational factor, 
included four items that required thought beyond the immediate task 
presented to the child. Factor three, identified as the affective 
factor, included items interpreted to reflect interests and attitudes. 


Findings 

The experimenter reports that recordings of the number of times 
the SCIS children understood the terms "Interact" and "Evidence" 
revealed that the children comprehended these terms in no less than 
85 percent of the cases. He also reports that these terms had to be 
defined for all items in the evaluations for the non-SCIS children. 


The average correlations between each item and its scale total 
(determined from the factor loadings) were 0.63 for the affective, 
0.60 for the cognitive and 0.78 for the motivational factors respec- 
tively. To compare the SCIS with the non-SCIS performance, the 
factor scores were calculated on a 3-way analysis of variance carried 
out for each factor. Correlations between the post-test evaluations 
and the California Test of Mental Maturity were not used as a covari- 
ate because they were too low. 


Factor one with a between treatment F ratio of 68.1 reportedly 
shows that the probability that this result could have been’ achieved 
by chance is low. Therefore, Allen concludes that the cognitive : 
performance of the SCIS children is significantly superior to that of 
the non-SCIS children with 35 percent of the test variance accounted 
for through a w2 calculation. However, the same calculation for the 
affective factor explains only 2 percent of the variance despite a 
determined statistical difference between groups for this factor. 

The researcher claims significant differences in cognitive performance 
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among the three socio-economic levels. However, shoud data were no 
given. No _ Significant difference was determined between SCIS and 
SCIS children on the motivational factor. 


? 


eee oe and Conclusions’ 

According to Allen, approximately 90 percent of the SCIS children 
tested understood by responding to questions using the terms "inter- 
action" and “evidence of interaction." Therefore, he concludes that 
the objectives for the Interaction and Systems Unit appear to be met 
satisfactorily. Honolulu, Hawaii, second grade children studying the 
SCIS program are statistically. superior to non-SCIS children in both 
cognitive and affective behavior categories when the input variables 
of science program membership, sex and socio-economic status are used 
as criterion measures. However, only in the analysis of cognitive 
scores, with 35 percent of the variance accounted for, is there a 
strong statistical association between dependent and independent 
variables. Therefore, the researcher concludes that while the SCIS 
program is producing cognitive behavior of some practical significance 
for the classroom teacher, affective and motivational performance 
appears to be no better than that of the comparison group. 


ABSTRACTOR'S ANALYSIS 


Reactions to this investigation are tempered by the understanding 
that this article describes only a segment of the total study. The . 
longitudinal nature of this research gives it a quality dimension. A 
long term look at the development of a select group ‘of subjects, their 
performance with the focus science program (SCIS) and the long term 
effect of this program has merit., The use of a comparison group adds 
power to the study. However, the design and research procedures of 
this particular segment of the research present some questions and 
shortcomings. 


One shortcoming is in the description of the sample population 
tested and the sampling procedures. The method of determining socio- 
economic status was likely influenced by teacher bias. It seems that 
family income indices would have been sufficient and more accurate 
for this determination. It is not clear why the researcher used 
metching by schools rather than by subjects. Howev@®, the use of 
matching never assures the researcher that differences between groups. 
on the post-test measure are exclusively the “result of an experimental 
variable. f , 


In addition, no information is available in the report qa the 

“ nature of the classrooms, the training of the teachers, their famil- 
iarity with the programs or the kinds of learning experiences engaged 
in by the students during the year. Neither does there seem to be a 
sufficient look at the students, their abilities and their interactions 
with the programs. One might assume that the children in the non- 

SCIS group were exposed to fewer "hands on" science experiences through- 
out the year. An even stronger bias against the non-SCIS group ‘is 
possible because of the-kinds of experiences, they did or did not have. 
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It is possible they did not have physical science experiences of the 
type engaged in by the SCIS group. Does this study show no more than 
that children who are using the terms "interaction" and “evidence” with 
regularity are more familiar with these terms? The control group 
received nothing but the outcome test, not even training in the 
vocabulary which was central to the outcome measure. Thus on the one 
hand the finding that the SCIS group does better is in one respect 
trivial. On the other hand, the fact that these students can apply 
these terms to other tasks is not trivial. However, one has to raise 
the question as to whether the difference between the performance of 
the control and experimental groups on the cognitive objectives is 

due to program differences or familiarity or lack of familiarity with 
the terminology. The use of the terms as evidence of cognitive output 
might be questioned. Although loading is evident in the factors 
tested, why and in what way is not described in the report. In all 
fairness to the investigator, this may be a consequence of the research 
reporting procedures required by the journal rather than an oversight. 


The researcher does not state any null hypotheses. For the ‘ 
analysis and the interpretation of the analyses to be clear the hypo- 
theses being tested should be stated. Multivariate hypotheses are 
stated in a particular manner. Therefore, for this study to be use- 
ful to someone interested in replicating it, their absence presents 
a limitation. Although the instrumentation used in the investigation 
is adequately described, no mention is made of the reliability and 
validity coefficients. A reader unfamiliar with the instrument is at 
a loss to determine its true value as used in the investigation. A 
major source of invalidity could lie with the absence of a regiability 
coefficient for the test instrument. If the measuring instrument is 
not reliable, a true treatment effect may well be cancelled by the 
inconsistency or fluctuation of the instrument. In other words, a 
type II error will occur more often. 


+ The investigator does not reference to any of the research that 
has been done to test theyeffectiveness of the SCIS program or, for 
that manner, any other research that would clarify the problem of 
this study or support its rationaie. However, in all fairness to 
Allen, some of this research has been reported since his study. 


Althodgh the experimenter has attempted to control for external 
validity by using random selection, questions must be raised about 
the appropriateness of the objectives selegted and the, validity of 
the test items for testing performance off these objectives. The 
affective and motivational objectives are not ones specified by the 
SCcIS er Therefore, the researcher is testing for an outcome 
not intended by the program developers. Secondly, one would have to 
doubt that the questions presented in the test instrument are suf- 
ficient criterion measures for the affective qualities or that an 
adequate number of items were included for this determination. 


Descriptions of the treatment are vague. The conditions and 
time period for the treatment and test are not described. This 
presents limitations for replication. A major source of invalidity 
‘ds possible. The results are presented in a concise manner. The 
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written description is ‘cdnaistent with the data and inferences are 
kept to a minimum and in terms of the results. 


“ The author stated that significant differences in regard to 
cognitive performances existed between the._three socio-economic levels. 
However, nothing is reportdd in the article that would substantiate 
this outcome. This also seems important for interpretation and 
replication of this study. \The author also stated that 90 percent of 
the SCIS children tested understood and therefore were able to respond 
to questions involving the térms "interact" and "evidence of inter- 
action."' However, no explanation of the basis for this generalization 
is given. It is not clear in\the article how the researcher kept 


track of student statements abbput interaction and evidence of inter- 
action. 


There appears to be no clear measure of attitudes toward the 
program experiences or test conditions. The motivational factor lacks 
clear definition. No significant ¥ifferences for this factor is con- 
tradictory to previous studies on SCIS program. There is no 
discussion of how the results of thi tudy enhance or-relate to mak- 
ing curriculum decisions as stated\ in ‘the purposes. 


s Finally, in terms of most educational research, the amount of 
variance accounted for was determined to be 35 percent” on the cognitive 


scores, which accounts for an appreciable portion of the variance, 
especially with such a large N. 
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Lawson, Anton E. "Sex Differences in Concrete and Formal Reasoning 
Ability as Measured by Manipulative Tasks and Written Tasks." 
Science Education, Vol. 59, No\ 3:397-405, 1975. 

Descriptors--Conservation (Concepts), *Critical Thinking, 
Educational Research, *Intellectual Development, Science 


Education, Secondary Education, *Secondary School Science, 
*Sex Differences 


Expanded Abstract and Analysis Prepared Especially for I.S.E. by 
Elizabeth K. Stage and Fletcher G. Watson, Harvard University. 


Purpose 


This study was to assess the Piagetian level of performance of males 
and females on tasks of concrete and formal reasoning abilities and to 
answer the following questions: 


1. Do males perform at a significantly higher (more formal) level 
than females on manipulative tasks of concrete and formal 
reasoning? ‘ 


2. Do females perform at a significantly higher. (more. formal) 
« level than males on a written examination of concrete and 
formal reasoning? 


3. Overall, do males and females demonstrate significant 
differences in their ability to reason formally? a 

4. Do the manipulative tasks and pencil and paper examinations 
seem to be measuring different psychological parameters 
depending on the sex of the examinee? 


" Rationale - 


Previous investigation shows that males' and females’ ability 
to demonstrate conservation reasoning may differ with respect to 
the format of the testing situation. They may also differ in ability 
to demonstrate formal reasoning depending upon the format of the 
testing situation, 


If significant differences are found by varying format, they 
should be taken into account when assessing the reasoning abilities 
of males and females, If different formats show one sex performing 
at a more formal level, then the question of why such a result is 
obtained should be raised. 
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Research Design and Procedure 


Sixty-two high school students (31 male, 31 female) were randomly 
chosen from a required high school biology course. Ages of the sub- 


jects were a mean of 15.2 for both sexes and a range of 14.4-17.5 for 
males and 14.2-17.1 for females. 


All subjects were given all instruments. 
1. Standard Piagetian manipulative tasks 


a. Bending rods - control of variables 
b. Balance equilibrium - inverse proportion 


Pencil and paper examination - adaptation of Longeot's test 
a. Proportionality (Early formal, IIIA) 


b. Propositional Logic (Fully formal, IIIB) 
c. Combinatorial analysis (Fully formal, IIIB) 


- 


3. Additional Measures 


a. Conservation of weight 
b. Volume displacement 


Findings 


1. Significant differences (p < .02) were found between the 


means of both manipulative tasks with males performing more 
formally than females. 


2.\ Significant differences (p< .10) were found only on the 
means of Propositional Logic written tasks with males per- 
forming more formally than females. 


3. Males conserved more than females on both weight and volume 
displacement tasks but the Chi square was significant (p < .001) 
for only the volume displacement task. 

4. 


A factor analysis was performed to assess whether the two 
manipulative tasks and the paper and pencil task were measur- 
ing the same psychological parameter. For the group as a 
whole, only one factor was extracted which accounted for 
48.7 percent of the variance. For males alone, only one 

- factor was extracted which accounted for 49.9 perdent of the 
variance. For females\alone, in contrast, two factors were 
extracted, one of which accounted for 48.2 percent and the 
other 20.5 percent of the variance., The pencil and paper 
measures loaded substantially on the first factor while the 
manipulative tasks loaded substantially on the second factor. 
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Interpretation 


\ 


* 
| The format of the testing situation differentially affected males' 


and females’ ability to demonstrate concrete and formal reasoning 
abilities, Males performed significantly better than females on manipu- 
lative reasoning tasks, Lawson ties this to the argument that males 

are, superior at cognitive restructuring tasks. 


The factor analysis \further indicated that format differentially 
affected males and female since only one factor emerged for males but 
two. for the females.” ° 


The sex of the experimenter may have had an effect since all of 
the testing was done by males, and other experimenters have found that 
this may yield superior male performance. > 


Males were also superior on conservation reasoning ability. 
Lawson ties this to the male cognitive style of field independence. 


ABSTRACTOR'S ANALYSIS 


Overall, Lawson's study seems well designed, carried out, and 
analyzed. The report is sufficiently detafled ‘to allow for replica- 
tion. The interpretation is the weakest component of the study 


A minor objection is his use of p < .10 as criterion for statistical 
significance in the Propositional Logic component of @#& pencil and paper 
task. If that were not used, there would be no statistically signifi- 
cant sex differences on the paper and pencil task. This treatment would 
yield\more support for his format differential argument and, more 
importantly, would mandate a more focused discussion of the relationship 
iia ‘aeaieac and the two factors on female performance. 


Tha major interpretive criticism, however, is more general. There 
seems to be a certain naivete or bias on the part of the author with’ 
respect/ to the sex differences literature. Both the cognitive réstruc- 
turing /and the field independence argument are ones which, although not 
discredited entirely by Maccoby and Jacklin (1), have been put in a 
more narrow perspective. Males have been found to be superior in visual- 
spatial abilities by the age of Lawson's subjects. Maccoby and Jacklin 
have concluded that "Boys do not excel at tasks that call for 'decon- 
textualization' or disembedding, except when the task is visual-spatial" 
(p. 350). In addition, "The sex difference in field independence is 
quite narrowly confined to visual-spatial tasks" (p. 105). Thus, if 
Maccoby and Jacklin are correct, cognitive restructuring and field 
independehce differences are derivative from visual-spatial differences. 
And thus, Lawson should compare his differences at the level of analysis 
of the degree of visual-spatial ability required by the format. This 
sort of interpretation is consistent with his much less striking aift- 
ferences on the paper and pencil task. 
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Also, the fact that the investigation focuses on sex differences 
should have been sufficient warning that he should have used both male 
and female examiners for both male and female subjects in order to rule 
out the potentially interfering effect. 
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Expanded Abstract and Analysis Prepared Especially for I.S.E. by 


John A, Easley, Jr. and Kenneth J. Travers, University of Illinois- 
Urbana. 
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. Purpose 


This series of five «tudies by Karplus and his associates at the 
University of California-herkeley, investigated the conceptual develop- 
ment of children in the ypper elementary grades and high school, and 
(in one study) of adults. Using the group test, the experimenters 
classified responses according to a hierarchical schema, with categories 
ranging from concrete to abstract levels of thought. The studies were 
intended to reveal information as to how well certain concepts, pri- 
marily that of ratio, function at various ages and to explore conditions 
under which learning of these concepts might be facilitated. Implica- 


tions of the findings for mathematics and science curricula were also 
discussed. 


Rationale 


The concepts of ratio and proportion are essential to an understand- 
ing of quantitative relations in science, as for example in many mathe- 
matical models. Hence, it is important that educators and researchers 
learn methods for determining the extent to which concepts, such as ratio, 
have been attained. Furthermore, curriculum developers ought to have at 
their disposal methods for identifying areas of need among students at 
various age and grade levels. The studies also sought to shed light on 
what tasks or techniques appear to promote concept acquisition and what 
learnings have accrued from various curricula. The possibility of cross- 
national comparisons of concept learning was also suggested. The use of 
group testing techniques (as opposed to clinical interviews) for assess- 
ing knowledge would make possible large-scale investigations of the ques- 
tions addressed by Karplus' research. 


The research appeared to have its point of departure in that of 
Inhelder and Piaget (4), which defines a stage of formal operations and 
links proportional thinking to this age. [However, these studies of 
intellectual development also relate to work by Lunzer and Pumfrey (8), 
Lovell and Butterworth (9), and Dienes (1). A review of these and other 
Studies purporting to deal with formal operations was presented by Lunzer 


(to appear) at the 1973 meeting of the Jean Piaget Society (see Abstractor's 
Analysis below).] 7 


Research Design and Procedure 

The distinguishing features of the five studies are summarized ‘in 
Table 1. Each study is subsequently referred to by the Roman numeral 
appearing in the title (e.g., "III" refers to Karplus and Karplus, 1972). 


The five studies had similar designs in that each consisted of the 
administration of a task, an analysis and classification of Ss' responses 


to the task, a tallying of the responses in the various categories, and 
a discussion of the resulting data. 
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4 TABLE 1 


Tasks Used in the Five Studies 


IV 


Paper Clips 
Fora B 
(15 min.) 


Group / 


Grades 4-9 
urban and 
suburban 
(N = 616) 


V 


*Paper Clips 
Form B (15 min.) 
®Candy (15 min.) 
**Ruler (2-4 min.) 
®éPulley (4-8 min.) 
“Workbook (20 min.) 
geometry 
arithmetic 


*Group 
**Individual 


Grades 7-8 
etudents 
suburban 
(W = 450) | 


“ 


Study : I Il III 
? : 

Name of task and Island Puzzle Paper Clips Paper Clips 
required comple- (10-15 min, ) Form A Form A 
tion time . (15 min.) (15 min.) 

> 
How administered Group Group Group 
Subjecte used Six groups Six groups Grade 6, 8, and 
(N = 449) (N = 727) 11 suburban 
from grade 5. from grades students test- 
through adult 4-12, urban ed in study II- 
oe (teachers of and suburban as grade 4, 6, 
7 science) and 9 
. (N= 155) 
plus new group 
of grade 8 
students 
(NW = 141) 
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ief Description of Tasks Used 


Island Puzzle: This puzzle involved four islands in an ocean 
and the possibilities of travel between them. The task was 


designed to assess abstract reasoning ability. Individual 
responses were written. 


Paper Clips, Form A: Ss each were given a sheet’of paper on 
which a stick figure 7 9/16 inches tall was drawn, together 
with a chain of from seven to ten No. 1 "Gem" paper clips. Ss 


were shown-a display chart with the same figure as they had on 
their sheets, with a scaled up version on the back side of the 
chart. The experimenter had a chain of eight jumbo paper clips. 
Ss were shown that the small figure was four "biggies" (jumbo 
paper clips) tall and the large figure was six "biggies" tall. 
Ss were to measure the small figure on their sheets with "smallies" 
(small paper clips) and to predict the tall figure's height in 
“smallies." Ss wrote their responses. < 

f I 
Paper Clips, Form B: This was similar to Form A, with the 
important difference that Ss did not ever see the tall figure. 
Ss were told that the small figure was four buttons tall and 
the large figure was six buttons tall. They were to measure 


the small figure in paper clips and predict the height of the 
large figure in paper clips. 


Candy: Ss were given written information concerning a person 
who had many bags of two kinds of candy, each bag containing 
a certain number of reds and a different certain number of 
yellows. All bags were alike. Ss were given written informa- 
tion relevant to predicting the number of candies taken from 
the bags. One task (a) involved application of the ratio 5:3; 
the other (b) involved the ratio 2:1. 


Ruler: An individually administered task was used, involving 
an unmarked rod and a ruler marked in both inches and centi- 
Meters. Ss were shown that a displacement of 2 inches on the 
rod was equivalent to a displacement of about 5 cm. Ss were 
then asked how many centimeters would be equivalent to a dis- 
placement of 8 inches. 


Pulley: Individuals were interviewed using a mechanism -which 
involved two pulleys (diameter ratio 3:2) fixed to same shaft, 
% string attached to each pulley, and a meter stick. S and 
experimenter worked together on the mechanism and S was shown 
that a displacement of 10 cm. on his or her BETINS corresponded 
to a displacement of 15 cm. on the experimenter’ s string. S 
was then asked how far the experimenter's string would move 
when S's string was displaced 6 cm. and to explain wi 


Workbook: Two tasks were designed to be similar/to those 
found in textbooks. One geametrical task was thtended to 
assess the S's ability to retognize a fraction of a whole and 
to represent this faction pictorially and numerically. The 
other task, numerical, required the S to apply proportional 
reasoning with no circumstantial clues. 


. 
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Categories Used to Analyze Tasks 
Study I: 


NN. (no explanation) 

I (prelogical) 

Ila (tyansition to concrete) 

IIb (concrete) 

IIIa (transition to abstract logic) 
IIIb (abstract logic) 


Studies LII-V: 


N (no explanation or statement given) 

I (intuition): estimates, guesses without reference 
to data 

Ic (intuitive ctmoutactauls data used haphazardly or 
illogically 

A (addition): uses difference rather than ratio 

Ss (scaling): uses change of scale, not related to 


scale inherent in data 
AS (addition and scaling): use of difference and scading 


IP (incomplete proportion): uses one ratio only 
P (proportional! reasoning): uses properties of pro- : 
portionality ‘ 


3 subcategories of P: 
PC (proportion, concrete) 
AP (addition and proportion) 
R (application of ratio) . 


Findings 


All five studies revealed a tendency for the mediansfrequency of 
tasks to move from the lower level categories (such as I and S) to the 
higher categories (such as P) as the grade level of the respondents 
increased. Wide fluctuations in performance within grade levels were 
found in II. In one urban class, 90% of the responses were placed in 
categories N, I, or IC, while in another class, categories AS and P were 
more numergus than in any one suburban class. In III, a longitudinal 
design prpduced data suggesting that the categories may be representa- 
tive of developmental levels. Of the 153 students involved in the two- 
year study, 28% moved into P or AS, while only one student moved out 
from P to A. At the lower levels, 7% moved from other categories into 
I or IC, while during the same two-year period, 65% moved out. It was 
also found, however, that 40% of the students remained in their cate- r 
gories for both testings. ‘ 


Table II, which combines the categories into three levels, sum- a 
marizes the classification of the Ss in study III over 2 years (1969 
and 1971). a 
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TABLE II 


Matrix Comparing Students in 1969 and 1971 by Levels 


(Number of Students) 


‘ \ 


1971 \ 
\ 
1969 Level I Level II Level ILI Toaal 
Level I = I + IC 19 $92 10 } “SL 
Es 
Level Il =S+A 7 45 26 | 78 
/ 
Level III = AS + P 0 1 23 24 
> 
Total 26 68 
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(From Karplus and Karplus, 1972, p. 739) i 


Study LV used w form of the Paper Clips Task. It was more 
abstract than Form A, that the S did nop ever see the figure whose 
height was to be predicted\using a proportion. A dramatic reductio 
was found in the number of Ss' (scaling) responses to Form A (30%) 1 
when Form B (4%) was used, with a corresponding increase in categories 
Ic, IP, A, and P. The latter categories require, the researchers assert, 
* conceptual processing of the data by the Ss (IV, p. 480). , 

The variety of tasks used in V proved to be interesting. Results 
on Form B of the Paper Clips Task were similar to those obtained by the‘ 
8th graders in IV. Sex differences in responses were not appreciable. 

It was found that the value of the ratio used is a factor to be taken 

into account. In the Candy Task, application of an integral ratio 

(2:1) was interpreted as not indigative of formal reasoning, since use 

of the ratio did not correlate wif} proportional reasoning in a more 
complicated task (V, pp. 597-598). ‘The Ruler Tagk was found to be easy 
for the junior high school students, since 87% résponded with proportional 
reasoning. The Pulley Task was more difficult and the geometrical and 
numerical items were most abstract (V, p. 604). It was noted that a 
perfect score on the numerical items was a good predictor of success 

on the geometric items, while the reverse did not hold. 


® 
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Interpretations . 

The researchers, overall, were disturbed by the implications of 
their findings. In study I, intellectual development, as assessed by 
their taxonomy, reached a "disappointingly low level" in the high school 
age group and did not progress much further (I, p. 403). In study II, 
it was found that successful preportional reasoning was not reached until 
the last years in high school. This concern was reiterated in III, where 
evidence was fotnd that many students did not advance to more abstract 
categories of thought during the intervening two years.of that longi- 
tudinal study. Another disturbing implication, prompted by the data of 
studies II and III, was evidence of apparent obstacles to learning which 
may be inadvertently set up by "mathematics courses, by teachers, and 
by the children's cultural environment" (II, p. 817). Of particular 
nottce was the dramatic contrast between the responses of urban and 
suburban llth and 12th grade students. It was found that 80% of the 


suburban students, but only 9% of the urban students,-were classified 
at the highest level P. 


The findings of study IV placed emphasis on the context in which 
the problem was presented. Since Form B of the Paper Clips Task was more 
abstract than Form A, the researchers concluded that Form B cempelled the 
students to make use of the data. Indeed, they concluded, Categories I 
and S under Form A (studies II and III) may reveal an attitude toward 
handling of the data rather ‘than the respondent's i aa level of 
competence (IV, p. 480). 


The variety of tasks used in V led the researchers to generalize c 
about the influence of a task upon the S's response. Tasks tending toward 
concreteness (Ruler, Paper Clips) led to pere correct responses than did , 
the abstract tasks. The lack of applicability of proportional reasoning 
to physical relationships raised questions about the appropriateness of 
many instructional strategies, particularly at the junior high school 
level, where only about 15% of the subjects were found to have reached 
the highest level. The researchers speculated that one source of the 
problem may be that ratios are introduced as fractions and proportions 
as equivalent fractions. "Curricula make little effort to interpret 
ratio, proportion, and the related division process in terms of .. . 
gt any of measurements. In this use of division, the concept . 
of remainde¥ has no place" (V, p. 610). 


: rY 
ABSTRACTOR'S ANALYSIS 


It should be clearly noted that Karplus and his colleagues are 
investigating a different problem from that studied by Piaget. It is 
easy to lump these together with all kinds of studies involving tasks 
that can be called "formal reasoning," by the ordinary meaning’of that 
term. Lunzer (7), for example, made no notice of the peculiar use 
Piaget and the Geneva School make of that term, but made a broad survey 
of problem-solying studies in which logic or other formal rules or pro- 
cedures were had end made it the basis for a critique of Piaget's. 
theory of the stage of formal operations.. Karplus and his’ collaborators 
are careful not to identify what they call proportional reasoning in . 
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studies II-V and what they call "abstract logic" in study I, with 
Piaget's Formal Operations. At least, they are quite open to the possi- 
bility that these may turn out to be different things. The situation 

is reminiscent of one involving the notion of relative motion. 


When Piaget argued that cndenieanding relative motion required the 
INRC group, and hence was a formal level task, Easley (2) attempted to 
provide an operational definition of the INRC group in the snail board 
problem Piaget had used in his studies of relative motion. However, 
Piaget made it quite clear that young children who could perform in a 
way that satisfied that operational definition would not thereby auto- 
matically be credited with having achieved the formal level (personal 
communication). Here in the studies under review, we have evidence 
from high school seniors (and even many adults) who do not employ 
"abstract logic" or "proportional reasoning" in these problems, but we 
are not entitled to infer that they have not achieved the stage of formal 
operations, as Piaget defines it, even though Inhelder and Piaget state 
that proportional reasoning is only attained at the stage of formal 
operations. 


Piaget's theory, briefly, is that four cognitive structures called 
logical operations (identity, negation, reciprocity, and correlation), 
usually between the ages of 10 and 14, unite to form a single structure, 
the INRC group. This greatly increases the power of the individual over 
the intuitive feelings of physical quantity operations. Thus, Piaget 
writes: 


It seems evident, in the instance of weight, that the 
difficulties of dynamic interpretation presented by this motion 
play a big role in the delay of its operational structuration, 
because of the contradictions that must be overcome between the 
demands of structuration and the diversity of objective causal 
situations. The same applies to volume, the delayed logicali- 
zation of which seems to be linked to geometric problems of 
internal continuum. . . going beyond the realm of concrete 
operation (11:3). 


Second, the group composition of the individual operations now permits 
operations on operations, which supports the development of proportion- 
ality. Thus Piaget writes: 


But we have also seen how the subject succeeds (first) 
in constructing by reflexive abstraction his multiplicative 
+ operations as additive operations at the second power, then 
his structures of proportionality by equalization of rela- 
tionships (therefore, again, by relationships of relation- 
ships or relationships at the second power) . . . (11:67). 
5) 


To discover whether a subject has attained the stage of formal 
operations for proportionality operations in a given context, Piaget 
and his colleagues employ the clinical interview with probing (at 
times, prompting the subject first in one direction and then in another 
to separate any efforts to*please the interviewer from what the subject 
genuinely believes) and continuing the probing until satisfied that the 
most advanced level of which the subject is capable has been demonstrated. 


They thereby incur the criticism of behaviorists -that they are leading 
and prompting their subjects. However, Piaget argued (10) that without 
such methods they cannot discover intellectual structures. 


On the other hand, Karplus and his colleagues are interested in 
the primary or spontaneous level of thought (defined in terms of their 
categories) that subjects employ when solving a paper-and-pencil test. 
Although they state that they have used interviews to check the levels 
of performance they get on paper-and-pencil tests, it is clear that they 
have avoided the clinical interview with its probing and prompting. This 
difference in purpose and procedure explains in part the differences in 
age distribution found between the two groups of studies. A second 
major contribution to these differences, which the research to date can- 
not isolate from the first, is the phenomenon Piaget calls decalage 
(separation or displacement). This refers to the delay in development 
from the first case in which a subject can use a given form of thought 
(in this case, formal operations) to a more difficult application that 
is different in content. While Piaget's theory of decalage is not very 
well developed, the essay quoted above [Piaget (11)] indicates that the 
developmental relationships between various kinesthetic structures and 
logical ones is complex. Another difference is that Karplus et al. 
(study V) employ a concept of types of reasoning which depends on the 
external static form of arguments. This contrasts sharply with Piaget's 
interest in internal dynamic processes [see Easley (3)]. 


The gulf between the narrow conceptual and methodological traditions 
that characterize these two groups of studies has not yet been bridged. 
Both groups have legitimate research interests and both have practical 
applications to education. It is interesting, for example, that the 
Elementary Science Study and the Science Curriculum Improvement Study, 
the latter of which is directed by Karplus, have generally been inter- 
preted as attempting to challenge the highest intellectual competence. 
of children and not merely tap their typical performance in a situation. 
It might seem then that Piaget's approach to research on cognitive 
development could be more relevant to Karplus' elementary school project 
than Karplus’ own research approach is, or perhaps that, in preparation 
for the development of new secondary school curricula, a more external 
form is thought to be required. Perhaps there is another reason we do 
not understand for the maintenance of this separation. 


The issues raised by this series of studies are important ones. 
Other mathematics educators will surely share the investigators’ con- 
cerns over the appropriateness of the current curriculum (and its 
typical method of implementation) for developing applications of mathe- 
matics to real-world problems. If it can be agreed that the mathe-_ 
matics curriculum should indeed promote creative use of numbers to 
describe physical objects (as is required in the sciences), then a 
research base fdr curriculum development in integrated mathematics 
and science programs is an important emphasis for future study. Also, 
despite the difference between Piagetian clinical interviews and 
Karplus’ paper-and-pencil tasks, both offer alternatives to the ‘con- 
ventional fascination with standardized (group-normed) tests of 
aptitude or achievement. 
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Karplus and his associates also raise the matter of cross-national 
comparisons of educational programs through an examination of intellec- 
tual develomment. Such a topic would appear to lend itself readily to 
consideration for inclusion in the proposed second round of school sub- 
ject surveys by the International Association for the Evaluation of 
Educational Achievement. Infdrmation is needed on the relationship 
bttween different curricular emphases found in various countries and the 
attendant differences in conceptual development. Might it be the case 
that some countries are indeed much more successful than others in 
promoting the attainment of proportional reasoning for the majority of 
students during the junior high school or early high school years? In 
terms of Piagetian methods, recent suggestions that the rate of intel- 
lectual development might be uniform across cultures [Kamara and Easley 


(5)] is not well supported beyond the stage of concrete operations. 
Further research on this front is needed also. 
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